Cross-validation methods in principal component analysis: A comparison

نویسندگان

  • Giancarlo Diana
  • Chiara Tommasi
چکیده

In principal component analysis (PCA), it is crucial to know how many principal components (PCs) should be retained in order to account for most of the data variability. A class of "objective" rules for finding this quantity is the class of cross-validation (CV) methods. In this work we compare three CV techniques showing how the performance of these methods depends on the covariance matrix structure. Finally we propose a rule for the choice of the "best" CV method and give an application to real data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Empirical Comparison between Grade of Membership and Principal Component Analysis

t is the purpose of this paper to contribute to the discussion initiated byWachter about the parallelism between principal component (PC) and atypological grade of membership (GoM) analysis. The author testedempirically the close relationship between both analysis in a lowdimensional framework comprising up to nine dichotomous variables and twotypologies. Our contribution to the subject is also...

متن کامل

Studying Spatial Changes of Groundwater\'s Nitrate Content in Central District of Khodabandeh, Iran

Background: This research aims to measure and study spatial changes, and the reason behind the increasing nitrate content in water wells in the Central District of Khodabandeh County in the Zanjan Province. Methods: The nitrate and nitrite content, electrical conductivity, dissolved oxygen, temperature, total hardness and pH were measured at 40 sampling stations in the study area. The obtained...

متن کامل

An application of principal component analysis and logistic regression to facilitate production scheduling decision support system: an automotive industry case

Production planning and control (PPC) systems have to deal with rising complexity and dynamics. The complexity of planning tasks is due to some existing multiple variables and dynamic factors derived from uncertainties surrounding the PPC. Although literatures on exact scheduling algorithms, simulation approaches, and heuristic methods are extensive in production planning, they seem to be ineff...

متن کامل

Spectrophotometric resolution of ternary mixtures of Dexamethasone, Polymyxin B and Trimethoprim in synthetic and pharmaceutical formulations

Four spectrophotometric methods are described and applied to resolve ternary mixtures of the corticosteroid Dexamethasone (DEX), the antibiotic Polymyxin B (PLX) and its encouraging Trimethoprim (TMP). The simultaneous determination of these three compounds was firstly accomplished by a derivative method using the “ratio spectrum-zero crossing derivative” (satisfactory results in synthetic mixt...

متن کامل

Bootstrapping Principal Component Regression Models

Bootstrap methods can be used as an alternative for cross-validation in regression procedures such as principal component regression (PCR). Several bootstrap methods for the estimation of prediction errors and confidence intervals are presented. It is shown that bootstrap error estimates are consistent with cross-validation estimates but exhibit less variability. This makes it easier to select ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006